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Abstract. We discuss a topological approach to words introduced by 
the author in |Tu2 | - |Tu4| . Words on an arbitrary alphabet are approx- 
imated by Gauss words and then studied up to natural modifications 
inspired by the Reidemeister moves on knot diagrams. This leads us 
to a notion of homotopy for words. We introduce several homotopy in- 
variants of words and give a homotopy classification of words of length 
five. 



1. Introduction 

Words are finite sequences of symbols, called letters, belonging to a given 
set a, called an alphabet. In these lectures we discuss an approach to com- 
binatorics of words based on their analogy with curves on the plane. To 
begin with, consider Figure H depicting a plane curve with distinguished 
origin and orientation. The three crossing points of the curve are labeled 
by letters a,b,c € a. Now, starting at the origin we move along the curve. 
The first crossing met is labeled a, the second one b, and so on. Finally we 
return to the origin and stop. Writing down the labels of the crossings as 
we encounter them, we obtain the word abcabc. This procedure, deriving a 
word from a closed plane curve with labeled double transversal intersections 
was introduced by Gauss \Ga\ in his attempt to classify such curves. Clearly, 
every letter appears in the resulting word twice. Words in which every letter 
appears twice are called Gauss words. 

It is easy to see that not all Gauss words can be realized by closed plane 
curves. The word abab for instance is not realizable by such a curve — see 
Figure IS Various conditions on Gauss words necessary and sufficient for 
their realizability by closed plane curves were obtained by several authors, 

see (Mil, |LMl> IE3> EH, Ewl, [CEl, IDEl- 

The aim of these lectures is to study arbitrary words using ideas taken 
from the topology of curves. To do this, we generalize the above picture 




Figure 1: A plane curve associated with the word abcabc 




Figure 2: The word abab is not reahzable by a closed curve on M? 

in the following three directions. First of all, we allow curves with self- 
crossings of arbitrary multiplicity > 2. For example, the word associated 
with the curve in Figure is ababa. In general, the number of occurrences 
of the label of a crossing in the associated word is equal to the multiplicity 
of this crossing. To handle letters appearing only once, we may distinguish 
a finite set of generic points on the curve and label them as well; we shall 
not do that here. 



Figure 3: A curve with a triple point yielding the word ababa 

The second direction in which we may generalize is to allow unlabeled 
or virtual crossings. Such crossings do not contribute at all to the associ- 
ated word. For example, the curve in Figure |3 gives rise to the word abab. 
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The idea of unlabeled crossings is inspired by the theory of virtual knots 
introduced by L. Kauffman |Kaj . 




virtual crossing 



Figure 4: A curve with an unlabeled crossing 

Thirdly, we may allow the same label to be used on different crossings. In 
Figure [51 there are three crossings Ai, A2, A^, all labeled by the same letter 
a € a. This leads us to so-called etale words that are words in an alphabet 
projecting to the given (fixed) alphabet a. The curve on Figure |S1 gives rise 
to the etale word ^i^2^3^i^2^3 where the letters Ai,^2)^3 project to 
a £ a. This etale word is a Gauss word in the alphabet {^1,^2)^3}; we 
call such etale words nanowords. 




Figure 5: A curve with label a on different crossings 



The general scheme of the topology of words is as follows: arbitrary words 
on the given alphabet a are approximated by nanowords and the latter are 
studied by methods inspired by the topology of curves. It is certainly in- 
teresting for topologists to apply topological methods to study such new 
objects. Our approach also leads to new questions concerning combina- 
torics of words. The accent in this theory shifts from words themselves to 
transformations of words, inspired by topology. The situation is similar to 
the one in knot theory where one focuses on isotopy classes of knots rather 
than on specific knot diagrams. 
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The present exposition follows my lectures given in the Research Institute 
for Mathematical Sciences (RIMS, Kyoto) in February 2006 and is based on 
my papers |Tulj - [Tu4| . The lectures were organized by Prof. Tomotada 
Ohtsuki. The lecture notes taken by Eri Hatakenaka, Daniel Moskovich and 
Tadayuki Watanabe served as the basis for this paper. I would like to express 
my gratitude to Prof. Ohtsuki for inviting me to RIMS and for organizing the 
lectures and to Eri Hatakenaka, Daniel Moskovich and Tadayuki Watanabe 
for the preparation of the notes. 

2. Words and nanowords 

In this section we give formal definitions of words, etale words, and 
nanowords. Fix a set a called the alphabet. 

2.1. Words. A word of length n > 1 on a is a mapping 

w: n ^ a, 

where n is the set {1, 2, . . . , n}. That is, a word of length n on a is a sequence 
of n elements of a. For example, the word w = aba, with a,b £ a, is nothing 
but the map 

w : {1, 2, 3} — > a, 

defined by w{l) = a, w{2) = b, and w{3) = a. By convention, there is one 
empty word (j) of length 0. 

The opposite word to a word w: n — > a, is denoted by w~ and defined 
by w~{i) = w{n + 1 — i) for all i £ n. For instance, \i w = abca, then 
w~ = acba. 

Concatenation of two words is defined by writing down the first word and 
then the second one. For example, the concatenation of the words w = abc 
and V = dbba on a is the word wv = abcdbba. 

One more operation on words is a change of the alphabet. For a map 
/ : a — > a' from an alphabet a to an alphabet a' and a word w: n — > a, set 
f^{w) = f o w. This is a word on a' of the same length n. For example, if 
w = abca, then f^{w) = f{a)f{b)f{c)f{a). 

2.2. Etale words. An a-alphabet is a set A endowed with a mapping to a, 
called the projection. The image in a of any A A will be denoted \A\. 
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A morphism of a-alphabets Ai and A2 is a mapping f : Ai ^ A2 such 
that 1^1 = 1/(^)1 for any A Ai- This means that the diagram 



Ai ^ A2 




commutes. An isomorphism of a~alphabets is a bijective morphism. 

An etale word over a is a pair (an a-alphabet A, a word in the a- 
alphabet A). In particular, every word on a becomes an etale word over 
a by regarding a as an a-alphabet A = a with projection to a being the 
identity map. Another example: let a G a and A = {Ai, A2, A^} with 
1^1 1 = 1 = 1^3 1 = a. The pair (A, A1A2A3A1A2A3) is an etale word over 
a. It corresponds to the picture on Figure |S1 

Two etale words {Ai,wi) and (A2,W2) over a are isomorphic if there 
is an isomorphism /: ^1 ^ A2 such that ^2 = f(,iwi). The relation of 
isomorphism will be denoted ~ . For example, if B = {Bi, B2, B^} is an 
a-alphabet with = |i?2| = I-B3I = a € a, then 

iB,BiB2B3B,B2Bs) « {A, AiA2A3A,A2As) 

where the etale word on the right is as in the previous paragraph. 

For an etale word {A,w), the opposite etale word is defined by {A,w)~ = 
{A,w~). The product of etale words {Ai,wi) and {A2,W2) over a is defined 
as follows. If AinA2 = 0, then the pair (^1 U A2,wiW2) is an etale word 
over a, and we call it the product of (^1, and {A2, W2)- If ^1 fl ^2 7^ 0, 
then we pick an etale word {A'l , w'l ) over a isomorphic to (^1 , wi ) and such 
that A'l r\A2 = 4>- We call then the etale word {A'l UA2, w[w2), the product 
of {Ai,wi) and {A2,'W2)- The product of etale words is well defined up to 
isomorphism. 

Beware that concatenation of words on a usually differs from multi- 
plication of the corresponding etale words. For example, for the words 
wi = abb and W2 = aa on the alphabet a = {a, 6}, the correspond- 
ing etale words are (^1 = {A,B},ABB) with \A\ = a,\B\ = b, and 
(A2 = {A',B'},A'A') with \A'\ = a,\B'\ = b. Their product is the etale 
word {{A,B,A',B'},ABBA'A'), with \A\ = \A'\ = a,\B\ = \B'\ = 6, while 
the etale word corresponding to W1W2 = abbaa is {{A,B},ABBAA) with 

\A\ = a, \B\ = b. These two etale words are not isomorphic. 
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2.3. Nanowords. A word u; on a finite alphabet is a Gauss word if every 
letter of this alphabet appears in w exactly two times. For instance, the 
words AABB, ABAB, ABBA, BAAB, BAB A, BBAA, are all Gauss words 
on the alphabet {A, B}. The words AA, ABA are not Gauss words on this 
alphabet. 

An etale word (A, w) is a nanoword over a if w is a Gauss word on A. 
Then the alphabet A is finite and the number of its elements is equal to the 
half of the length of w. For example, let A = {A, B} with |^| = a € a and 
\B\ = b €z a. The etale word {A, ABAB) is a nanoword. Another example: 
A = {A,B,C} with l^l = a € a, \B\ = b ^ a, and \C\ = c (z a. Then the 
etale word (A, ABCBCA) is a nanoword. 

Two nanowords {Ai,wi) and {A2,W2) are isomorphic if they are isomor- 
phic as etale words. 

We now make a few simple remarks about nanowords. If {A, w) is a 
nanoword then its opposite {A,w)~ is also a nanoword. The concatenation 
of two nanowords is a nanoword. An empty etale word {A = (j),w = (p) is a 
nanoword. The set of nanowords over a is infinite provided a ^ (p. 

Note finally that a plane curve with fixed origin and labeled crossings 
gives rise to a nanoword if and only if this curve is generic — that is all its 
self-intersections are double transverse crossings. Thus, the nanowords can 
be thought of as combinatorial analogues of generic curves. 

2.4. Desingularization. Consider again the curve on FigureOl By a small 
deformation near the triple point, we obtain a generic curve whose singu- 
larities are double points A12, A2^3, Ai^^, B shown on Figure IHl We label 
the points ^1^2,^2,3,^1,3 by a and the point B by b. Thus, each crossing 
of the deformed curve has the same label as the corresponding crossing of 
the original curve. The left curve on Figure El gives the word w = ababa. 
The right curve gives the Gauss word w"^ = 741^2^1,3-8^1^2^2,3-6^1^3742,3 
in the a-alphabet {Ai^2, ^2^3, A3, B} with |74i,2| = |^i,3| = 1^2, 3I = cl and 
\B\ = b. We view the nanoword w'^ over a as the desingularization of the 
word w = ababa. 

The desingularization procedure considered in this example can be gen- 
eralized and leads thus to desingularization of arbitrary etale words. More 
precisely, for every etale word {A, w) over a, we define a nanoword {A'^, w'^) 
over a called its desingularization. For any A £ A, the multiplicity mu){A) 
is the number of times that A occurs in w. For instance, mABBiA) = 1 and 
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Figure 6: Desingularization of a curve 



mABBiB) = 2. We define the set A'^ by 

A'^ = {{A,i,j) \ A£A, l<i<j< myj{A)}. 

Denoting {A,i,j) by Aij, we define the projection A"^ ^ a by \Aij\ = \ A\ G 
a. This makes A'^ into an a-alphabet. Here every letter of multiphcity 
m > 1 in w gives rise to letters in A^. The word w'^ on the alphabet 

A'^ is defined in two steps. 

Step 1. Delete from w all letters of multiplicity 1. 

Step 2. For each A & A with mw{A) > 2 and for each i = 1, 2, . . . , m^(^), 
we replace the i-th entry of ^ in w by 

Al,i A2^i ■■■ Ai_ii Aii^i Ai^i^2 ■■■ ^i,miu{A)- 

The resulting word w'^ on A"^ is a Gauss word and the pair {A'^,w'^) is 
a nanoword over a. For example, for the etale word {A = {A,B},w = 
ABABA) with \A\ = a, \B\ = b, this procedure gives the nanoword 

{A'^, W"^) = {{Ai^2, ^2,3, ^1,3, ^1,2}, ^,2^1,3^1,2^1,2^2,3^1,2^1,3^2,3) 

with \Ai^2\ = 1^2.3! = 1^1,3! = and I-B1.2I = b. 

We can say that the desingularization of curves is based on viewing the 
crossings under a strong microscope which allows us to see the "internal 
structure" of each crossing. This analogy suggested the term nanoword. 

3. Knot theory and homotopy of nanowords 

3.1. Knot theory. We shall study nanowords using the analogy with curves 

and knots. Recall the classical Reidemeister moves on knot diagrams in M?, 

shown in Figure [7| (The inverse moves are also called the Reidemeister 

moves.) These moves and the isotopy of knot diagrams in M? generate an 

equivalence relation on knot diagrams which we call the R~equivalence. We 
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have 



{knots in M^}/ isotopy = {knot diagrams}/ R-equivalence. 

This fundamental equahty, due to K. Reidemeister, reduces the study of 
isotopy classes of knots to a study of R-equivalence classes of knot diagrams. 




Figure 7: Reidemeister moves on knot diagrams 



Similar moves can be considered on pointed curves. For the sake of the 
following discussion, we switch to curves — that is we make no distinction 
between under-crossings and over-crossings. We always assume that the 
moves act away from the origins of the curves. 

Let us look at the effect of the Reidemeister moves on words associ- 
ated with curves. The first Reidemeister move, shown in Figure |S1 acts 
as xAAy i— > xy, where x and y are words not including the letter A. 

Consider the second Reidemeister move with labels, orientations, and the 
position of the origin as in Figure 1^1 The move acts on the associated word 
as xAByBAz i— > xyz where x, y, z are words not including the letters A^ B. 
For another choice of orientations, the move may act as xAByABz i— > xyz. 
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Figure 8: First Reidemeister move on a labeled curve 



The first version xAByBAz ^ xyz is sufficient for our aims as will be clear 
from the results below. 




Figure 9: Second Reidemeister move on a labeled curve 



Consider the third Reidemeister move with labels, orientations, the posi- 
tion of the origin, and the order of branches as in Figure [TUl This move acts 
on the associated word as xAByACzBCt ^ xBAyC AzCBt where x, y, z, t 
are words not including the letters A,B,C. For other choices of orientations, 
order of branches etc., the move may act differently but the version shown 
on Figure [TUl is sufficient for our aims. 




Figure 10: Third Reidemeister move on a labeled curve 
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3.2. Homotopy of nanowords. Let a be an alphabet (a fixed set). We 
fix homotopy data consisting of an involution r: a — > a and an arbitrary 
set SCa^ = axaxa. The geometric meaning of r and S will be dis- 
cussed in the next section. In the context of knot diagrams, r switches 
between positive and negative crossings. The role of S is to determine the 
transformations of labels accompanying the third Reidemeister move. 
Three homotopy moves on nanowords over a are defined as follows. 

(1) (AjxAAy) {A — {A},xy) where x,y are words on the alphabet A — 
{A}. Note that if {A,xAAy) is a nanoword, then so is {A — {A},xy). 
The inverse move adds a new letter A to the a-alphabet and inserts A A 
into the word. 

(2) {A,xAByBAz) i-^ - {A,B},xyz) provided |^| = t{\B\) and x, y, z 
are words on the alphabet A — {A, B}. 

(3) {A,xAByACzBCz) ^ {A, xBAyCAzCBt), provided {\A\,\B\,\C\) £ 
S and X, y, z, t are words on the alphabet A — {A, B, C}. 

Two nanowords over a are S-homotopic if they can be related by a finite 
sequence of homotopy moves, inverse moves, and isomorphisms. We denote 
this equivalence relation by ~5 and call it S-homotopy. This definition 
readily extends to etale words: etale words wi and W2 are S-homotopic if 
wf ~5 In particular, the notion of iS-homotopy applies to words on a. 

We will use the following notation: 



Clearly, Af{a, S) is a monoid, with the empty nanoword as its unit element 
and concatenation as its product. This monoid depends on r which is how- 
ever omitted in the notation M{a, S) to make it shorter. 

The following two lemmas show that our three moves generate a wider 
set of similar moves. In the context of Figures |H1 and IIUI the new moves 
correspond to other choices of orientations, branch connections, etc. 

Lemma 3.1. Let A,B,C be three distinct letters in an a-alphahet A and 
let X, y, z, t be words in the alphabet A — {A, B, C} such that xyzt is a Gauss 
word in this alphabet. Then we have the following S -equivalences: 



Af{a,S) = {set of nanowords over a} /5-homotopy. 



(1) 



{A, xAByCAzBCt) -5 {A, xBAyACzCBt) 



if{\A\,T{\B\),\C\)(^S, 



(2) 



{A, xAByCAzCBt) -5 {A, xBAyACzBCt) 
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if{T{\A\),T{\B\),\C\)eS, and 

(3) {A, xAByACzCBt) -5 {A, xBAyCAzBCt) 

zf{\A\M\B\)M\C\))^S. 

Lemma 3.2. Suppose that Sn{axbxb) ^ cf) for allb G a. Let {A,xAByABz) 
be a nanoword over a where A,B £ A with \ A\ = t{\B\) and x, y, z are words 
on the alphabet A — {A, B}. Then 

(A, xAByABz) -5 {A - {A, B}, xyz). 



Proof. Set b = \B\ £ a. By assumption, there is e G a such that (e, 5, b) G S. 
Pick a letter E not belonging to A and set = r(e) G a. Then 

(Move 

{A, xAByABz) -5 U {E}, xAEEByABz) 



~5 



{A U {E},xEABEyBAz) 



(Move 2) 

~5 {Au{E}-{A,B},xEEyz) 

(Move 1) 

In the second line we use the 5-homotopy of Lemma I3.1l ('2l. where A 
is replaced by E, B by A, and C hy B. This homotopy applies since 
{T{\E\),T{\A\),\B\)={e,b,b)eS. □ 

3.3. Typical questions. In analogy with knot theory, the main objective 
of the homotopy theory of words is to classify etale words and nanowords up 
to 5-homotopy. Putting it differently, the goal is to compute the monoid 
M{a,S) at least for some choices of a, r, and S. We are very far from 
reaching this goal. Available results are outlined in the rest of the paper. 

Taking knot theory as a model, we list here several typical questions 
concerning the homotopy of words. 

(Ql) Is a given nanoword 5-contractible, i.e., 5-homotopic to the empty 

nanoword ? This question corresponds to the question of whether or 

not a given knot diagram presents an unknot. 
(Q2) Is a given nanoword w homotopically symmetric, that is 5-homotopic 

to ? Note that opposite words corresponds to knots with opposite 

orientations. 
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(Q3) Define the length norm of an etale word w by 

ll^lls ~ 2 length of a nanoword 5-homotopic to w). 

Note that = if and only if w is contractible. For any etale 

words wi,W2, 

\\wiW2\\s < Ikllls + lk2||<S- 
Compute the length norm. 

4. Curves and knots as nanowords 
In this section we clarify the relations between curves, knots, and nanowords. 

4.1. Curves as nanowords. In the sequel, the word "curve" means the 
image of a generic immersion of an oriented circle into an oriented surface. 
Here "generic" means that the curve has only a finite set of self-intersections, 
which are all double and transversal. The curve may be immersed into any 
oriented surface, of any genus, compact or not, with boundary or not. Note 
that all self-intersections of a curve look locally like \ . Triple points and 
self-tangencies are not allowed. Every curve has a regular neighborhood. 
This is a narrow band around the curve inside the surface, see Figure 1111 
Note that the orientation of the ambient surface induces an orientation of 
the regular neighborhood. 

A curve is pointed if it is provided with a base-point which is not a self- 
intersection. An example of a pointed curve is drawn on Figure ITTI 




Figure 11: A curve on a surface of genus two 

Two pointed curves are stably homeomorphic if their regular neighbor- 
hoods look exactly the same including the position of the curves in these 
neighborhoods. Here is a more precise definition. 
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Definition 4.1. Two pointed curves are stably homeomorphic if there is an 
orientation-preserving homeomorphism of their regular neighborhoods map- 
ping the first curve onto the second one and preserving the origin and the 
orientations of the curves. 

The stable homeomorphism class of a curve is determined solely by the 
germ of the ambient surface near the curve; what happens outside a regular 
neighborhood does not matter. In particular, adding handles and punctures 
to the surface away from a neighborhood of the curve does not change its 
stable homeomorphism class. 

Recall the definition of the stable equivalence of curves from |KKj . |CKSj . 

Definition 4.2. Two pointed curves are stably equivalent if they can be 
related by a finite sequence of the following transformations: 

(1) Stable homeomorphism. 

(2) Homotopy of the curve in its ambient surface away from the origin. 

The homotopy in (2) may push a branch of the curve across another 
branch or across a double point but not across the origin of the curve. 

Pointed curves related by any sequence of moves (1), (2) are stably equiv- 
alent. Thus, we may start with a curve, transform it by a stable homeomor- 
phism, deform the resulting curve, add handles, deform again, puncture the 
surface, etc. All these transformations preserve the stable equivalence class 
of the curve. As an exercise, the reader may show that any two pointed 
curves on the 2-sphere are stably equivalent. The same is true for curves on 
the 2-torus. Pointed curves on surfaces of higher genus are not necessarily 
stably equivalent. The classification of stable equivalence classes of pointed 
curves is an interesting topological problem. 

We note here three geometric invariants of pointed curves preserved under 
the stable equivalence: the minimal crossing number, the genus, and the 
virtual number. The minimal crossing number ||c|| of a pointed curve c is 
the minimal number of crossings of a pointed curve stably equivalent to c. 
The genus g{c) is the minimal integer (7 > such that c is stably equivalent 
to a pointed curve on a closed surface of genus g. To define the virtual 
number of c, note that any pointed curve on with a distinguished set 
of "virtual" crossings represent a stable equivalence class of pointed curves. 
One simply does not look at the virtual crossings or, equivalently, trades 
a branch of c near each virtual crossing for a branch going along a small 
1-handle attached to and avoiding the rest of the curve. The virtual 
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number v(c) is the minimal integer v >0 such that there is a pointed curve 
on with v virtual crossings representing the stable equivalence class of c. 
It is clear that v{c) > g{c). 

Denote by C the set of stable equivalence classes of pointed curves. The 
elements of C are called long flat knots |Kaj or open virtual strings |Tulj . 

We now relate the theory of curves with the theory of nanowords. Con- 
sider the following homotopy data: 

ao = {o, 6}, ro(a) = 6,ro(6) = a, 5o = {(a, a, a), (6, 6, 6)} C a;]. 
Theorem 4.3. There is a canonical bijection 

This theorem shows that the theory of nanowords includes the theory of 
pointed curves as a special case. We outline a construction of the bijection 
C — > AA(ao,5o). Consider a pointed curve on a surface. Label its crossings 
in an arbitrary way by different letters Ai,A2,--- , An where n is the number 
of crossings. The Gauss word of the curve is obtained by moving along the 
curve starting at the origin and writing down the letters as we encounter 
them, finishing when we get back to the origin. The resulting word, w, on 
the alphabet 

A= {Ai,A2,...,An} 

contains every letter Ai, A2, ■ ■ ■ , An twice. We provide A with the projection 
to ao as follows. Consider the crossing of the curve labeled A^. If when 
moving as above along the curve, we first traverse this crossing from the 
bottom-left to the top-right, then \Ai\ = a, otherwise \Ai\ = b; see Figure 
1121 where the orientation of the ambient surface is counterclockwise. The dot 
on the left (resp. right) picture is the bottom-left (resp. bottom-right) entry 
of the crossing. In this way the set A becomes an ag-alphabet. We assign 
to our curve the class of this nanoword in M{ao,So). We must prove that 
stably equivalent curves give rise to 5o-homotopic nanowords. A different 
choice of the labeling of the crossings gives an isomorphic nanoword. If the 
curve is changed by a stable homeomorphism, then the associated nanoword 
does not change, since it is defined entirely by the behavior of the curve in 
its regular neighborhood. A homotopy of the curve may be split into a 
composition of local Reidemeister moves and the inverse moves. Then one 
verifies that under these moves the associated nanoword changes via the 
5o-homotopy moves and the transformations in Lemmas 13.11 and I.S.2I The 
resulting mapping C J\f{ao,So) is bijective, see |Tu3j . 
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1st 



1st 



Figure 12: On the left \Ai 



a and on the right \Ai\ = b 



Under the identification C = J\f{ao,SQ) the minimal crossing number of 
curves corresponds to the length norm on Af{aQ,So). The genus and the 
virtual number yield interesting geometric invariants of nanowords over oq. 

One can suppress all references to the origin in the definitions above. 
This gives a relation of stable homotopy for non-pointed curves. To obtain 
a corresponding notion for the nanowords, one has to introduce an additional 
move on nanowords, the so-called circular shift. Briefly speaking, the shift 
moves the last letter of the word to the first position. For details, see |Tu3j . 

4.2. Knots as nanowords. The constructions of the previous section can 
be upgraded to the setting of knot diagrams. By a (pointed) knot diagram 
we mean a (pointed) curve on an oriented surface with additional data at 
each crossing: one of the branches lies "over" and the other one lies "under" . 
A pointed knot diagram on is shown on Figure 1181 



Two pointed knot diagrams are said to be stably homeomorphic if there 
is an orientation-preserving homeomorphism of the regular neighborhoods 
of the underlying curves, sending the first diagram onto the second one and 
preserving the origin, the orientation, and the over-/under-crossing data. 

Recall the stable equivalence of knot diagrams from |KKj . |CKSj . 

Definition 4.4. Two pointed knot diagrams are stably equivalent if they can 
be related by a finite sequence of the following transformations: 

(1) Stable homeomorphism. 




Figure 13: A pointed knot diagram 
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(2) The Reidemeister moves on a knot diagram in its ambient surface 
away from the origin. 

The moves in (2) may push a branch of the diagram above or below a 
double point or another branch but not across the origin. Thus, the origin 
may not lie inside the neighborhoods where the Reidemeister moves are 
performed. 

Let /C denote the set of stable equivalence classes of pointed knot dia- 
grams. Elements of /C are called long virtual knots, see |Kaj . |GPVj . 

The set /C includes the set of isotopy classes of classical knots. Classical 
knots are oriented knots in S^. There is a map 

{Classical knots} /isotopy ^ JC 

obtained by picking an arbitrary diagram of the given classical knot, picking 
an arbitrary origin on the diagram (not a crossing), and taking the stable 
equivalence class of the resulting pointed knot diagram. This gives a well- 
defined mapping from the set of isotopy classes of classical knots into /C. 
This mapping is known to be injective, see |Kaj . |GPVj . 

The set /C can be interpreted in terms of nanowords as follows, Set 

• a* = (a_|_, o_, 6+, 

• : a* ^ a=K the involution defined by r*(a+) = 5_ and T*(a_) = 6+, 

• cS^ = {(a±,a±,a±), (a±,a±,a=F), {a,^,a±,a±), 
{b±,b±, 6±), {b±,b±, 6zp), (6=p, 6±, b±)} C a^. 

Theorem 4.5. There is a canonical bijection 

IC AA(a*,5*) 
such that the following diagram commutes: 

/C — - — > A/'(q!=„, 5*) 

C AA(ao,5o). 
Here the map IC ^ C is given by forgetting the over- /under-crossing data, 
and the map Af{a^,S^) —i- AA(aoi'5o) is given by a± i— > a, 6-t i— > b. 

This theorem shows that the theory of nanowords includes the theory of 
long virtual knots as a special case. The definition of the bijection /C — > 
AA(a*,5=K) goes similarly to the one for curves. The difference is that now 
we project to a* rather than to ao- The rule is shown on Figure ITU 
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Figure 14: From left to right: 

\Ai\ = a+, \Ai\ = \Ai\ = a_, and \Ai\ = 6_ 

To extend these ideas to hnks, one has to involve phrases, i.e., sequences 
of words, see fTCa) . [TuH. 

4.3. An extension to general a. Let a be an arbitrary alphabet with 
homotopy data r, 5. Nanowords over a can be geometrically interpreted 
as follows. Pick a mapping /: a — > ao such that /r = tq/ and /(a) = 
f{b) = f{c) for any triple (a, b, c) S 5. Every nanoword {A, w: n — > a) over 
a determines a nanoword = {A, fw: n ao) over ao- The latter 

can be represented by a pointed curve on a surface. Thus, w gives rise to 
a family of pointed curves {/^(w)}/ underlying w and numerated by / as 
above. Geometric invariants of these curves provide geometric information 
about w. Stable equivalence classes of these curves depend only on the 
5-homotopy class of w. 

This geometric representation of w seems to be especially efficient in the 
case where S is the diagonal of so that the conditions on / reduce to 
the equivariance relation /r = tq/. One interesting question: when does 
a given family of stable equivalence classes of pointed curves numerated by 
equivariant maps a — > ao arise from a nanoword over a ? 

A similar geometric interpretation of nanowords in terms of knot diagrams 
can be obtained by replacing qq with a*. 

5. Invariant 7 and self-linking of nanowords 

In this section and in the sequel, the symbol a denotes a (fixed) alphabet 
with involution r : a ^ a. 

5.1. The set S. For the rest of this paper, we set 

S = diagonal = {(a,a,a)}^g^ 

The third homotopy move is xAByACzBCt ^ xyz provided |^| = \B\ = 

\C\. In the sequel, we leave S out of notation. By homotopy of nanowords, 

we mean iS-homotopy when iS is the diagonal as above. The homotopy 
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relation is denoted ~. The case of knots is excluded by this choice of S, but 
the case of curves is covered. 

We give now an example of homotopic words. Pick a € q such that 
r(a) 7^ a. Set b = T{a). Consider the words: 

wi = aabab, W2 = babaa, W3 = baaab 

and the nanoword 

= {{A,A'},AA'AA') where |^| = \ A'\ = a. 
Claim, ~ tt;2 — ~ W4 

Proof. We prove only that wi W4. The proofs that W2 — W4 and W3 ~ W4 
are similar. 

The desingularization of wi gives 

wf = ^1,2^1,3^1,2^2,3^^1,3^3^ 

where |^i,2| = |^i,3| = 1^2, si = cl and \B\ = b. By Lemma l3.21 we can strike 
out the two occurrences of ^2,3-6. This gives the nanoword ^41,2^1,3^1,2^1, 3 
isomorphic to Zi;4. □ 

This example shows that the relation of homotopy is quite non-trivial. 

5.2. A group theoretic homotopy invariant. We construct here a ho- 
motopy invariant of nanowords, 7. First, define a group 11 by generators 
and relations: 

n = {{Za}a(^a I ^aZr{a) = 1 for all O G o). 

Note that if a 7^ ''"('^)) then we have a free generator Za = (2T(a))~^ of 11, 
and if a = T(a), then we have a generator Za of order 2. 

For a nanoword {A, w: n ^ ^) of length n, we define n elements 71, 72, 
. . ., 7„ of n by: 

_ f ^|«,(i)h ^(*) ^ ^(•?') ^ < 

\ otherwise. 

Here w{i) G ^ is the i-th letter of w and \w{i)\ € a is its projection to a. 
The sequence 71,72, ... ,7n may be also described as follows. Since w is a 
nanoword, each letter of A appears twice in the sequence w{l),'w{2), . . . , w{n) 
The first time it appears we write at this place the corresponding generator 
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of n. At its second appearance, we write down the inverse of that generator. 
This procedure gives the sequence 71, 72, . . . , 7^- Set 

7{w) = 7172 • • • 7n G n. 

Since each generator appears in this product twice with opposite powers, 
the abcUanization of ^{w) is zero. Thus, 7(1^) hes is in the commutator 

subgroup [n, n] c n. 

For example, consider the nanoword w = ABAB with \A\ = a ^ a and 
\B\=he a. Then -i{w) = ZaZ^z'^z'^ G [n,n]. 

Theorem 5.1. ^{w) is a homotopy invariant of w. 

Proof (outline). Under the first homotopy move, ^{xAAy) = ^{xy) because 
the first appearance of A contributes Za and the second appearance of A 
contributes z~^ where a = |^|. So we have the invariance under the first 
homotopy move. The other two moves are treated similarly. □ 

The mapping 

7: N{a) =M{a,s) [n,n] 

is a monoid homomorphism {S is the diagonal). It is easy to check that it 
is surjective. 

The group [11, 11] can be shown to be free for any r. If r has at least two 
orbits, then this group is non-trivial and by the results above, N{a) is an 
infinite monoid. If r has at least three orbits, then [11,11] has rank > 2, and 
by the results above, M{a) is a non-abclian monoid. 

Let us consider two examples where 7 does not work. The interesting 
case in topology is the case of curves, where a = {a, h} with T(a) = h. In 
this case: 

n = {Za,Zi, I ZaZb = 1) = Z 

and [n, n] = 0. So, for topology the invariant 7 is of no interest. Another 
example: a = {a} with r(a) = a. In this case 11 = Z/2Z and [H, H] = 0. 

5.3. Self-linking of nanowords. We introduce here another homotopy in- 
variant of nanowords, the so-called self-linking. We begin with the following 
observation. Consider the words ABAB and A ABB. The letters A,B are 
obviously linked or interlaced in the first word and unlinked in the second 
one. Consider now an arbitrary nanoword {A, w) over a. We say that two 
letters A,B & A are w interlaced if 

w = ...A--B--A--B-- or w = ■ ■ ■ B ■ ■ ■ A- ■ ■ B ■ ■ ■ A- ■ ■ . 
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In the first case set nju{A, B) = 1 and in the second case set B) = —1. 

In all other cases set nw{A,B) = 0. The function is skew-symmetric in 
the sense that for all A,BgA, 

n^,{A, B) = -nyj{B , A) and n^{A, A) = nw{B, B) = 0. 

Now consider the abelian group vr = 11/ [11, 11]. The group operation in vr 
will be written multiplicatively. Each generator Za £ll with a £ a projects 
to an element of n denoted a. Thus, 

— {{'^}aea \ 0-b = ba and ar(a) = 1 for all a,b £ a). 

For a nanoword ( A, w) over a and every A £ A, set 

[Au = n isr-(^'^) G TT. 

BeA 

For a £ a, set 

Ag^, \A\=a 

The function a i-^ Ztt, a i— s- [aj^; is called the self-linking of w. The following 
theorem derives from this function a homotopy invariant of w. 

Theorem 5.2. 

(1) For any a £ a, the difference [a]u, — [T(a)]^ £ Zir is a homotopy 
invariant of w. 

(2) If T{a) = a, then [a] ^ mod 2 £ (Z/2Z)7r is a homotopy invariant 
ofw. 

For a proof, we refer to |Tu2j . By this theorem, the letters of a give rise 
to homotopy invariants of nanowords over a. These invariants reflect the 
linking of letters in nanowords. Here is a simple application of this invariant. 

Pick a,b £ a and consider the nanoword w = Wa,b = ABAB with 1^41 = a 
and \B\ = b. By Lemma 13.21 if a = t(6), then w is contractible. We 
can use the invariant 7 and the self-linking invariant to show the converse: 
if w is contractible, then a = t(6). Indeed, suppose that o 7^ We 
have ^{w) = ZaZhZ~^z^^. If a / 6, then a, 6 lie in different orbits of r and 
therefore Za does not commute with z^ in 11. Then "f{w) ^ 1 and w is 
non-contractible. If a = 6 7^ ''"(o), then "f{w) = 1. However, in this case 
1^1 = \B\ = a = b and 
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Then [a]^ = a + a^-*^ 7^ € Ztt. Since there are no letters in {A, B} 
projecting to Tq, we have [T(a)]^ = 0. These computations and the previous 
theorem imply that w is non-contractible. 

5.4. Applications of the self-linking. Recall the norm on the nanowords 
\\w\\ = -(minimal length of a nanoword homotopic to w). 

We can use the self-linking to estimate this norm from below. The idea is 
that elements of a group ring may be treated like polynomials and for a 
polynomial we can consider its degree. When w is not too long, there are 
not so many factors in the self-linking invariant, and the degree can not be 
too big. Instead of stating here general theorems, we give an example of the 
resulting estimate for a specific nanoword. Consider the monoliteral word 

a"^ = aa - ■ ■ • a 

formed by m copies of a letter a G a. If T(a) = a or m = 1,2, then this 
word is contractible. If a 7^ r(a) and m > 3, then the self-linking invariant 
gives us the following estimate: 





> 


m' 


X 


m — 1 




.2". 




2 



+ 1 



where [x] denotes the greatest integer which is smaller than or equal to x. 
In particular, has a positive norm and is non-contractible. The estimate 
of 1 1 a"* 1 1 given above is very rough. I suppose that it gives approximately 
twice the actual value of the norm. My conjecture is that 

_ m{m - 1) 
ll« II- 2 • 

Theorem 5.3. Let a,b G a such that a ^ r(a) and b 7^ t(6). The words a"* 
and 6" with m, n > 3 are homotopic if and only if a = b and m = n. 

So such monoliteral words are not homotopic unless they coincide. The 
proof goes by comparing the self-linking invariants. 

5.5. Geometric interpretation of the self-linking. We give a geometric 

interpretation of the self- linking in the case of curves, that is in the case 

where a consists of two letters a, b permuted by r. The group n is then the 

infinite cyclic group with generators a, b satisfying ab = 1. 

Consider a curve c on an oriented surface with crossings Ai, A2, An. 

Each crossing A^ gives rise to two sub-curves of c as follows. Start from A^ 
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and go along c in the positive direction until coming back to Ai for the first 
time. The resulting closed curve is a sub-curve of c. The two branches of c 
passing through Ai give rise in this way to two sub-curves of c. One of them 
passes through the origin of c, we call this sub-curve the thin curve. The 
other, complementary sub-curve is called the thick curve, cf. Figure 1131 




Figure 15: Thin and thick curves associated with a crossing Ai 

Consider the homological intersection number: 

ki = thin curve • thick curve G Z. 

Recall that the homological intersection number of two curves on an oriented 
surface is obtained by deforming the curves into a transversal position and 
then counting their intersections with appropriate signs. 

Consider the nanoword ({Ai, ...,An},uj) corresponding to the curve c. 
Then for all i = 1, n, 

[Ai]u, = lAil''^ G TT. 

This formula gives a geometric interpretation of the symbol [j4j]u,. The self- 
linking is obtained by taking the sum of these symbols over the crossings 
Ai with fixed projection to the alphabet {a,b}. To ensure the invariance 
under the first Reidemeister move, we restrict the summation to Ai such 
that ki ^ or, equivalently, [Ai]^ ^ 1. 

6. Linking pairings of nanowords 

The invariants defined so far, 7 and the self-linking, are insufficient to 

classify even short words. We need more invariants. One idea is to consider 

again the geometric situation of nanowords associated with curves. With 

each crossing Ai we associated a 'thick curve' on the ambient surface. We 

can consider the intersection numbers of these curves with each other. This 

gives an n X n integral matrix where n is the number of crossings and the 
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entry is the intersection number of the thick curves determined by Ai 
and Aj. This matrix can be computed directly from the nanoword. This 
leads us to so-called linking pairings of nanowords. 

In this section, the symbol vr denotes the multiplicative abelian group 
associated with a,r in Section ESI 

6.1. a— pairings. We begin with purely algebraic definitions, whose connec- 
tion to nanowords will be explained later. 

Definition 6.1. An a-pairing is a tuple consisting of a set S, a distin- 
guished element s £ S, a mapping S — {s} — s- a, and a skew-symmetric 
pairing b: S x S ^ it. 

By skew-symmetric, we mean that b{A, B) = b{B,A)^^ for all A, B £ S 
and b{A, A) = 1 for ah AeS. 

An a-pairing can be shortly written as 

{S,s,b: 5 X 5 ^ vr). 

The mapping S — {s} — > a will be encoded by saying that the set S — {s} 
is an a-alphabet. The image of any A £ S — {s} under this mapping will 
be denoted \A\. 

The notion of isomorphism for a-pairings is defined in the obvious way. 
Given an a-pairing {S, s,b: S" x 5 — > vr), we define annihilating elements 
of S and twins as follows. 

Definition 6.2. An element A £ S — {s} is annihilating if b{A, C) = 1 for 
all C eS. 

Definition 6.3. Elements A,B £ S - {s} are twins if b{A, C) = b{B, C) 
for allC £S and \A\ = t{\B\). 

An a-pairing is primitive if it has no annihilating elements and no twins. 
For example, the trivial a-pairing {S = {s}, b{s,s) = 1) is primitive. 
We introduce two moves Mi, M2 on a-pairings. 
Mi: Delete an annihilating element. 
M2: Delete a pair of twins. 
The moves Mi, M2, the inverse moves M^^ , M2^, and isomorphisms of a- 
pairings generate an equivalence relation on the class of a-pairings, called 
homology. The following theorem classifies a-pairings up to homology. 

Theorem 6.4. Every a-pairing is homologous to a primitive a-pairing. 
Two homologous primitive a-pairings are isomorphic. 
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Thus, in each homology class of a-pairings there is a primitive one unique 
up to isomorphism. Starting with an arbitrary a-pairing, we can delete 
annihilating elements and twins and get a primitive a-pairing. The latter 
is uniquely determined by the homology class of the original a-pairing at 
least up to isomorphism. 

6.2. From nanowords to a— pairings. The connection between a-pairings 
and nanowords is this: to each nanoword w over a we shall assign an a- 
pairing bw Its homology class will be a homotopy invariant of w. 

Let {A,w: n — s- .4) be a nanoword over a. Set S = {s} U A- We have a 
projection S — {s} = A ^ a. The skew-symmetric pairing bw' S x S ^ tt 
is defined in four steps. 

Step 1. For every ^ G ^, we can write uniquely w~^{A) = {iajJa} C n 
where iA_ < Ja ■ Thus iA is the position in which the letter A appears 
in w for the first time, and Ja is the position in which the letter A 
appears in w for the second time. Thus w is of the form: 

w = ■■■ A--- A--- . 

iA JA 

Step 2. Given two letters D,E E A, set 

DoE= Y\ |F|G7r. 

iD<iF<3D and iE<jF<jE 

Step 3. The w -linking oi D,E ^ A\s defined by 

\\iyj{D,E) = {D o E){E o D)-^ GTT. 

Step 4. Finally, the form 6^ : S x S —>■ tt is defined as follows: biu{s, s) = 1, 

bUA,s) = [AU= n l^r^^'^^GTr, 
BeA 

by,{s,A) = {[A]^)-'^ e TT, 

by,{A,B) = {lK{A,B)f ^ 

for any A,B e A = S - {s}. 

The following theorem justifies this definition and relates the homotopy 
of nanowords to the homology of a-pairings. 

Theorem 6.5. Homotopic nanowords have homologous a-pairings. 
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This theorem together with Theorem 16.41 give an efficient method to 
distinguish nanowords. Given a nanoword w, we first compute the associated 
a-pairing b^j and then apply the moves Mi and M2 to get a primitive a- 
pairing. The isomorphism class of the latter is a homotopy invariant of w. 

6.3. Applications. One application of the a-pairings is the following es- 
timate of the length norm of nanowords: if (S'+,s,6+ : 5+ x 5+ ^ vr) is a 
primitive a-pairing homologous to the a-pairing (5, s, 6^) of a nanoword w, 
then 

\\w\\ > card(S'+) — 1. 

Indeed, if w is homotopic to a nanoword w' of length 2m, then the a-pairing 
(S", s, b') of w' is homologous to (S, s, bw) and therefore reduces by the moves 
Mi,M2 to the same primitive a-pairing (S^, s,b^). Hence 

m = card(S'') — 1 > card(S'+) — 1. 

In particular, if the a-pairing (S,s,bw) is primitive, then w has minimal 
length in its homotopy class. 

The a-pairing {S,s,b^) can be used to estimate the geometric genera 
of surfaces carrying the underlying curves of w. Pick an equivariant map 
/: a ^ ao where ao = {a,b} is the 2-letter alphabet with involution per- 
muting a,b. The nanoword f^{w) over ao (defined in Section corre- 
sponds to a pointed curve on a compact surface. We can estimate the genus 
g of this surface hy g > (1/2) rank (M) where M is the skew-symmetric 
integral matrix obtained from the matrix {bw{si, S2)}si,s2eS by the group 
homomorphism vr — > Z sending the generators of vr belonging to f~^{a) to 
1 G Z and the generators of tt belonging to f~^{b) to —1 G Z. This estimate 
follows from the geometric interpretation of the ao-pairing of f^{w) in terms 
of the intersection numbers of curves. 

Another area of applications of a-pairings is the homotopy classification 
of nanowords. With the help of a-pairings we can establish the following 
theorem. Recall the nanoword Wafi = ABAB, \A\ = a, \B\ = b defined for 
any a,b a. As we know, Wa^b is non-contractible if and only if a 7^ ^ib)- 

Theorem 6.6. Two non-contractible nanowords Wa,b Oind Wa',b' with a, b, a' , 
b' ^ a are homotopic if and only if a = a' and b = b' . 

Using the a-pairings and the invariants introduced in further sections, we 
establish the following theorem. It gives a complete homotopy classification 

25 



of words of length 5 in which one letter, a, occurs 3 times, and another 
letter, b, occurs 2 times. 

Theorem 6.7. Let a,b be two distinct letters of the alphabet a. Then: 

(1) The words aaabb, aabba, abbaa, bbaaa are homotopic to each other; 
they are contractible if and only if T{a) = a. 

(2) The word baaab is contractible if and only if T{a) = a. 

(3) The word ababa is contractible if and only if T{a) = b. 

(4) The words abaab, baaba, aabab, babaa are never contractible. 

(5) A non- contractible word from (2) - (4) is never homotopic to a word 
from (1). 

(6) Two non- contractible words from (2) ~ (4) are homotopic to each 
other if and only if they coincide letterwise (i.e., if and only if they 
are the same word written twice) with the following exceptions: 

aabab ~ babaa ~ baaab for r(a) = b. 

A more general homotopy classification of all words of length < 5 is given 
in |Tu2j . We can think of such classification theorems as analogues of knot 
tables. First we draw all possible knot diagrams and then decide which 
diagrams represent isotopic knots. The same kind of problem arises for the 
homotopy of words. 

6.4. Examples. 1. We show how to compute the a-pairing associated with 
the word w = abaab where a ^ b. We have 

w'^ = A3A2BA3A1A2A1B, \Ai\ = IA2I = l^sl = a, \B\ = b 

where to simplify notation we write Ai,A2,A3 for ^2,3)^1,3,^1,2 respec- 
tively. The matrix for nyj is computed by 

"0-1 " 
10-11 
10 1 
0-1-1 _ 

where the rows and columns correspond to ^1,^2,^3,-B respectively. The 
matrix for Ik^ is computed by 

"11 fl-i 1 " 

11 la 

a 1 1 a 

_ 1 a-^ a-^ 1 _ 
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where the rows and columns correspond to Ai, A2, A^, B, respectively. Fi- 
nally, we compute the a-pairing 6^: 



1 


a 




a-'b-' 






1 






1 


b 


a2 


1 






ab 


a2 


a2 


1 




a-2 


1 






1 



where the rows and columns correspond to s,Ai,A2,A3,B, respectively. 
Recall that tt = {{cjcea \ cr(c) = 1). In particular a and b are non- 
trivial elements of tt. This implies that the elements Ai and A2 of S = 
{5,^1,^2,^3,5} are non-annihilating. If A^ is annihilating, then = 
ab = 1. Then a = b, which contradicts the assumption a ^ b. A similar 
argument shows that B is non-annihilating. It is also easy to check that bw 
does not have twins. Thus, the a-pairing bw is primitive. Therefore w is 
non-contractible, and \\w\\ =4. 

2. The a-pairings are strong enough to distinguish short words and 
nanowords in many cases. The following example shows however that in 
some cases the a-pairings are powerless. 

Consider the word w = ababa where r(a) = a ^ b = T(b). A direct 
computation shows that the a-pairing of w'^ is given by the following matrix 
over tt: 



1 


ab 


1 


a-^b- 


1 


1 




1 




a-%- 


2 




1 




1 


a-%- 


2 


1 


ab 


a^^ 


a%^ 


1 




ab 


1 


ab 


1 




1 


1 



As above, the rows and columns correspond to s,Ai,A2,A2„B respectively. 
The equality a = r(a) implies that = 1 and similarly b^ = 1. Therefore 
the matrix above simplifies to the following matrix: 

1 ab 1 ab 1 
a6 1 1 1 a6 

11111 . 
a6 1 1 1 a6 

1 ab 1 ab 1 

Since the third row and the third column consist only of I's, the element A2 
is annihilating. Eliminating it, we observe next that Ai and A3 are twins. 
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What remains after their ehmination are two elements s and B. The matrix 
becomes as follows: 

1 1 
1 1 

where the rows and columns correspond to s and B. Now B is an anni- 
hilating element. Its elimination gives the trivial a-pairing. Therefore the 
a~pairing associated with w'^ gives no information at all and does not allow 
to decide whether w = ababa is contractible or not. In fact all invariants of 
nanowords considered so far are trivial for this word (under the assumptions 
that r(a) = a / 6 = t(6)). 

This example shows that we need more invariants to prove Theorem 16.71 
We shall introduce further invariants in the next sections. 

7. Further invariants of nanowords 

We outline here several ideas inspired by knot theory and leading to ho- 
motopy invariants of nanowords over a. 

7.1. Tricolorings. In knot theory one may treat any knot diagram as con- 
sisting of disjoint arcs. A coloring of the diagram assigns a residue mod 3 to 
each arc, such that for every crossing, the sum of the three residues assigned 
to the adjacent arcs is equal to 0. The number of such colorings of a knot 
diagram is a knot invariant. This definition is due to R. Fox. 

We can introduce similar definitions for words. Fix a set /3 C a such that 
r(/3) = /? (the resulting invariants may depend on Consider a nanoword 
w = {A, w : n ^ A) over a. For any letter A £ A, let za < Ja be the first 
and the second indices enumerating the positions A in w as in Section 
IH.2L A tricoloring of if is a function /: {0, 1,2, ... ,n} Z/3Z such that 
for any A £ A, if \A\ € f3, then 

f{iA) = f{iA-l) and f{jA-l) + f{jA) + f{iA) = 

and if 1^41 G a — /?, then 

f{jA) = fijA-l) and f{iA-l) + f{iA) + f{jA) = 0. 

The residues /(O) and f{n) are called respectively the input and the output 
of/. 

Tricolorings of w may be alternatively described as follows. We first write 
w with dashes between consecutive letters: 



-u;(l) -w{2) - 
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— w{n) — 



Enumerate the dashes from left to right by the numbers 0,1, ... ,n. Then 
the function / as above can be seen as an assignment of a residue mod 3 to 
every dash. The conditions above mean that for any A £ A, the coloring 
has the following form near the two entries of A in w: if \A\ £ [3, then it 
looks like 

c , c c' , c" 

A A 

with c + c' + c" = and if | A| G /? — a, then it looks like 

c' , c" c . c 

A A 

with c + c' + c" = 0. The input of the coloring is the residue assigned to the 
leftmost dash and the output is the residue assigned to the rightmost dash. 

For example, the function assigning one and the same residue to all dashes 
is a coloring. It is called the trivial coloring. 

Theorem 7.1. For any k, I G Z/3Z, the number of tricolorings of a nanoword 
w with input k and output I is a homotopy invariant of w. 

Note that this number may depend on k,l, and on the choice of (3. We 
can easily compute this number for the empty nanoword (it has only one 
dash), li k = I, then this nanoword admits one coloring with input k and 
output /. If k ^ I, then there are no such colorings. By Theorem 17.11 the 
same is true for any contractible nanoword. 

Now we give an example of a nanoword which admits a coloring with 
distinct input and output. Consider the nanoword 

VU = A1A2BA3A1BA2A3, 

where \Ai\ = \A2\ = {A^l = a G a, \B\ = b £ a such that a and b lie 
in different orbits of r. Set /? = {a,r(a)} C a. The nanoword w has the 
following non-trivial coloring with input and output 1: 

0< 0. 0„1. 1. 2„2. 1. 1 

— Ai— A2^ B — A3 — Ai~ B — A2~ A3 — . 
By the remarks above, this nanoword is non-contractible. 

7.2. Module of a nanoword. A related but stronger invariant of knots 
is the Alexander module. It can be computed from a knot diagram via an 
explicit presentation by generators and relations. We apply a similar idea 
to nanowords. First, we introduce the group 

= {{a,a,}aea \ aa, = a,a, aT{a) = 1, a,T(a), = 1). 
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We already considered groups 11 and vr given by similar presentations. In 
^, each letter a € a gives rise to two commuting generators a and a,. In 
the case of knot diagrams on surfaces, this phenomenon of doubling of the 
number of generators was already observed by Sawollek |Saj who studied 
generalizations of the Alexander polynomial, see also |SWlj . 

Let A = Z^' be the integral group ring of ^. This ring will play the role 
of the ground ring for our modules. 

Fix a set /3 C a such that t(/3) = /?. Consider a nanoword w = 
{A, w: n ^ A) over a. We derive from w & {n + 1) x n matrix over A 
whose rows are numerated by the dashes of w. Each letter A ^ A gives 
rise to two rows. To write them down, set a = |A| G a and assume that A 
appears in w for the first time at the i-th position and for the second time 
at the j-th position. If a € /?, then the two rows determined by A are 

i 3 

- A - ■■■ - A - 

•••0 a -1 0---0 O--- 

•••0 l-aa, 0---0 a, -I O--- 

If a G Q — /?, then the two rows determined by A are 

« i 

- A - ••• - A - 

•••0 0---0 a -1 O--- 

•••0 a, -I 0---0 l-aa, O--- 

All unspecified entries of the rows are 0. The resulting n x (n + 1) matrix 
over A determines a A-homomorphism ^ : A" ^ A"+^ whose cokernel 

Kp{w) = A^+VV'IA"). 

is a A-module. This module has distinguished elements: the "input" v- and 
the "output" 1"+ represented by the leftmost dash and the rightmost dash, 
respectively. 

Theorem 7.2. The triple {Kfj['w),V-,Vj^) considered up to isomorphism is 
a homotopy invariant of w. 

One can derive further homotopy invariants from the triple {Kj3{w),v^ , 
or directly from the presentation matrix of Kp{w) introduced above. For 
example, one may remove the first (or the last) column and consider the 
resulting n x n matrix over A. Then we can take its determinant over the 
ring A"'' obtained by abelianization of A. Following this line of thought and 
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with a little more work, one obtains two "polynomial" homotopy invariants 
of w belonging to A*^^ (see |Tii2j for details). They are denoted V^(if;) and 
Vt{w) and satisfy the following duality: 



and a, = T(a), for all a £ a. 

7.3. The invariant A. We now focus on the case where (3 = a. This will 
lead us to a homotopy invariant A of nanowords taking values in the ring A. 
This invariant is a generalization of an invariant introduced by Silver and 
Williams |SW2j for curves. 

Consider the A-module Kp{w) = Ka{w) associated above with a nanoword 
w: n ^ A. Let xo, a^i, . . . , x„ be the generators of Ka{w) given by the 
dashes. Each letter A £ A gives rise to two relations 



where a = |^| G a and i = ia < j = JA are the elements of w~^[A). Each of 
these relations expresses a generator via the previous generators. Therefore 
Ka{w) is a rank one free A-module generated by the input v- = xq. The out- 
put vj^ = Xn £ Ka{w) has the form = X'v- for a unique A' G A. Theorem 
17.21 implies that A' = A'(tu) is a homotopy invariant of w. This invariant is a 
non-commutative polynomial. It admits an equivalent but more convenient 
version X{w) defined as follows. Consider the involutive anti-automorphism l 
of A keeping fixed all the generators {a, a,} of A. Thus, l acts on monomials 
by reading them from right to left. For instance i{aab,) = b,aa. Set 



We describe a method allowing to compute Xiw) and generalizing a method 
due to Silver and Williams |SW2j in the context of curves. We do it here 
for a few examples, the general method |Tu2j should be clear. 

Example 7.3. Consider the nanoword w = ABAB, \A\ = a € a, \B\ = b £ 
a. First draw the following graph: 




Xj+i = axi, Xj = a,Xj-i -|- (1 — aa,)xi 



I) 



Xiw) = l{X'{w)) e A. 



a 



b 



a. 







1 



2 



3 



4 



— A — 



B 



— A — 



B — 
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Each vertex of this graph corresponds to a dash in w and each edge corre- 
sponds to a letter in w. Recall that every letter appears twice. The edge 
corresponding to the first (leftmost) appearance of A is labeled with a; the 
edge corresponding to the second (rightmost) appearance of A is labeled 
with a,. Connect the left vertex of the first edge with the right vertex of 
the second edge by an arc in the upper half-planc and label this arc with 
1 — aa, G A. Do the same for the letter B replacing everywhere a = \A\ by 
b = \B\. The resulting picture is drawn on the next figure. 




Consider all paths starting at the input and going to the output along 
the edges and arcs, always from left to right. We record the elements of 
A labeling the arcs and edges on the path and multiply them following the 
order determined by the path. The polynomial A(u;) is obtained as the sum 
of the resulting elements of A over all paths. In this case there are three 
such paths: 

(1) The path — 1 — 2 — 3 — 4 contributes aba,b,. 

(2) The path - 1 4 contributes a(l - bb,). 

(3) The path 0^3 — 4 contributes (1 — aa,)b. 

Then 

X{w) = aba,b, + a(l — bb,) + (1 — aa,)6,. 

The ring A has a natural grading as follows. Recall that 

A = Z[o, a.jaga/aa, = o,a, aT(a) = 1, a,r(a), = 1. 

The defining relations are homogeneous with respect to degrees mod 2. 
Therefore 

A = Ao,o©Ao,i®Ai,oeAi,i, 

where Aij is generated by monomials in which generators without bullets 
appear i times mod 2 and generators with bullets appear j times mod 2. 
Every A G A expands uniquely as the sum 



A — Ao,o + Ao,i + Ai^o + Ai,i, 
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where Ajj € Aj for all For w = ABAB, this expansion of X{w) gives: 

>^o,oiw) = aba,b,, 
^o,i{w) = —abb, + b,, 
Ai,o(f«) = a- aa,b,, 
\i,i{w) = 0. 

These computations allow us to give another proof of the fact that w is 
contractible if and only if a = t(6). Indeed, if w is contractible, then X{w) = 
1 and hence Ao,i(u^) = 0. This implies that abb, = b,. Hence ab = 1 and 
a = T{b). 

Example 7.4. We apply A to the word ababa with a = r(a) 7^ 6 = r(6). As 
we saw above, the corresponding a-pairing gives no information about the 
homotopy properties of w. By definition, X{w) = X{w'^). The desingulariza- 
tion of w is the nanoword 

w'^ = A3A2BA3A1BA2A1, \Ai \ = \A2\ = {Asl = a, \B\ = b. 

To compute X{w), we draw the following graph: 




Then 

X{w) = (1 - aa,)2 

+ (1 — aa,)ab,a1 ^ 

+a(l — aa,)a, — ^ — 

W{l-bb,)al 

+a^ba,{l — aa,) ^ 

+a^baa,b,al. 

The assumptions a = r(a) and b = t(6) imply that a^ = b^ = al = bl = 1. 
After simplification, we obtain that 



-^0,0 (^) = 2 — ba — a,b, + baa,b, 
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If Ao,o(i^) = 1) then one of the two elements ba and a,6, of the group ^ 
must be equal to 1. This is possible only if a = r(6) = b, which contradicts 
the assumptions. So, Xofl{w) 7^ 1 and w is non-contractible. 

Example 7.5. Consider the nanowords 

wi = ABACBC, \A\ = \C\ = a, \B\ = T(a), 
W2 = AC AC, \A\ = \C\ = a, 

where a € a satisfies r(a) ^ a. These two nanowords are not distinguished 
by A. In fact, all the techniques described so far fail to distinguish these 
nanowords up to homotopy. This can be done using the methods introduced 
in the next section. 

8. a-KEIS AND WORDS 

8.1. a— keis. Keis were introduced in 1942 by a Japanese mathematician, 
M. Takasaki, see S. Kamada |Kamj for a comprehensive survey of keis, their 
generalizations, and connections with knot theory. A kei is a set X with 
multiplication * which satisfies a few axioms, the main axiom being 

(x * y) * z = (x * z) * {y * z) 

for all x,y,z G X. One may think of x * y as of a kind of conjugation of x 
by y. 

To produce homotopy invariants of words, we introduce a notion of an 
a-kei, where a is a set with involution r. An a-kei is a non-empty set X 
with maps 

X ^ X, X ax and X x X ^ X, (x,y) ^ x *a U ^ X 

numerated by a € a such that the following axioms are satisfied: 

(1) ax *a X = x, 

(2) a{x *a y) = ax *a ay, 

(3) (x *a y)*aZ= {x *a az) *a {V *a z), 

(4) ar(a)x = x, 

(5) [x *a y) *r(a) = X, 

for all a G a and x,y,z G X. Arbitrary a-keis can be presented by genera- 
tors and relations as groups in group theory. 

Example 8.1. Recall the non-commutative ring 

A = Z[a, a,]aGa/aa, = a,a, ar(a) = 1, a,r(a), = 1. 
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Any left A-module X becomes an a-kei with kei operations x i-^ ax and 



x*ay = a,x + (1 - a,a)y. 

The a-keis obtained by this construction are said to be abelian. 

8.2. a— keis of nanowords. The theory of keis can be apphed to produce 
homotopy invariants of nanowords. Fix a set /? C a such that t(/3) = (3. 
For any nanoword {A,w: n ^ A) over a, we define an a-kei )Cp{w). It 
is generated by n + 1 symbols Xq,Xi, . . . ,Xn satisfying the following n 
defining relations. Each letter A e A gives two relations. To write them 
down, assume that A appears in w for the first time at the i-th position and 
for the second time at the j-th position, where i < j. If a = \A\ G P, then 
the relations are 

Xi = aXj_i, Xj = Xj-i *(j 
If a = \A\ e a — l3, then the relations are 

= Xi-i *a ^j-lj = aXj-i. 

The elements V- = Xq e JCp{w) and V+ = Xn € lCi3{w) are called the input 
and the output, respectively. 

The idea behind these fornuilas comes from knot theory. In knot theory, 
every knot diagram gives rise to a so-called quandlc. Quandles arc gener- 
alizations of keis and also have only one operation, the binary operation *. 
The quandle associated with a knot diagram is determined by generators, 
associated with the arcs of the diagrams, and relations, associated with the 
crossings, cf. the picture on the left hand side of the following figure. 



X y X ay 




y x*y y x*ay 

In the setting of nanowords the situation is somewhat different. First, each 
crossing is labeled by a letter, a G a, which allows us to involve the operation 
y ^ ay absent for knots. The binary operation also depends on a. 
Also, the two incoming branches are ordered. This leads us to the defining 
relations as above, whose geometric interpretation is shown on the right 
hand side of the figure. 
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Theorem 8.2. The triple {ICi3{w),V^,V^), considered up to isomorphism, 
is a homotopy invariant of w. 

The A-module Kj^{w)^ viewed as an a-kei, can be computed from K,p{w). 
Namely, there is a homomorphism of a-keis K,p{w) — >■ Kp{w) such that 
for any homomorphism from lCi3{w) to an abehan ct-kei X, the following 
diagram is commutative: 

K.p{w) ^Kp{w) 




X. 



8.3. Characteristic sequences. Consider in more detail the case (3 = a. 
Looking at the defining relations, we easily observe that JCp{w) = K,a{w) 
is a free a-kei generated by the input V-. The output V+ E K.a{w) is a 
homotopy invariant of w. The structure of free a-keis is poorly understood, 
which prevents us from deriving further invariants of w from V+. We focus 
on a special case where more information is available. 

Suppose that the involution r: a — > a is fixed-point-free, that is r(a) ^ a 
for all a € a. Fix a set a+ C a meeting every orbit of r in exactly one 
element. Thus, 

a = a+ U r(a+), a+ n t(q!+) = 0. 

Recall the group ^ introduced in Section [7.21 We show how to derive from 
any nanoword w over a a finite sequence {£itpi,£2''p2, ■ ■ ■ , ^m'4'm) with m > 0, 
V'l , . . . , V'm £ ^5 and ei,...,£m G This sequence is a homotopy 

invariant of w (possibly depending on It determines X{w) by 

m 

X{w) = ^EiiJi G A = Z^-. 

i=l 

In the setting of curves, this sequence was introduced by Silver and Williams 
ISW2|. 

We first define an a-kei F as follows. Let F be the free group generated by 
the group ^, viewed as a set. Each element ip € ^ gives rise to a generator 
of F, denoted ip. In particular, the unit 1 G ^' gives rise to a generator 
1 F which is by no means the unit of F. A typical element of F has the 
form 
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where m > 0, '4'i,ip2, ■ ■ ■ ■, i^m ^ ^ ■, and £i, . . . , € {±1}- Such an element 
is the unit of F if either m = or it can be reduced to the case m = 
by applying the relations 'tp{tp)~^ = {ip)~^'4^ = 1. The left action of ^ on 
itself extends to a group action of on F by group automorphisms. The 
generators a, a, G ^ act on F by 

This defines in particular the mapping F ^ a; ax for all a G a. The 
binary operation x y for x, y G -P is defined by 

x*ay = y{a,x){a,ay)'~^ G F, 

if a G Q;+ and 

x*ay = {ria)~^T{a)-^y)-^{a~^x)y G F, 

if a G a — a+. These operations make F into an a-kei. 

Recall that starting with a nanoword w, we obtained a homotopy invariant 
element of the free a-kei }Ca{w) on one generator V-. Since ICa{w) is 
free, there is a unique a-kei homomorphism / : K-aiw) — > F such that 
f{V-) = 1 e F. Then f{V+) G F is a homotopy invariant of w. We can 
expand 

/(y) = (3^1)^1 • • • (Vv^)^- G F, 

where ijji, . . . ,ijjm S and £i,...,£„i G {±1}- The resulting sequence 

{eiipi, . . . jEmi^m) is well-defined up to insertion or deletion of consecutive 
pairs —tp) and +ip)- Deleting all such pairs, we obtain a uniquely 
defined sequence (eiiIji, . . . ,em'V'm') with m' < m which is a homotopy in- 
variant of w. This is the characteristic sequence of w. 

8.4. Examples. 1. Pick a, 5 G a_|_ and consider the nanoword w = ABAB 
with 1^1 = a and \B\ = h. It is easy to compute from the relations that 
V+ = (haV- *a ^-) *a oV-. The characteristic sequence of w is computed to 
be 

(a, b,,b,a,ba, —b,a,a, —b,ba). 

In particular, if a = 6, then this sequence is (a, a,, a^a^, — a^a, — a.aa). 

2. Consider the nanoword wx = ABACBC, \A\ = \C\ = a, \B\ = r(a) 7^ 
a. Its characteristic sequence (determined by any a+ C a as above such 
that a e a+) is: 

(1, a,, — aa,, —1, a, aa», —a^a,, aa», a^a^, —aal, —aa»). 
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Comparing with the previous example (for a = b), we obtain that wi is not 
homotopic to the nanoword W2 = AC AC with |^| = \C\ = a. This result 
was claimed at the end of Section [Tj 

3. One might think that such a powerful invariant as the characteristic 
sequence should distinguish arbitrary non-homotopic nanowords. However 
this is not true, as shows the following example. Pick four letters a,b,c,d G a 
(possibly coinciding) and consider the nanoword 

w = ABCDCDAB, \A\ = a, \B\ = b, \C\ = c, \D\ = d. 

An inspection shows that if a 7^ r(&) and c 7^ ''"(d), then the a-pairing 
of w is primitive. Then \\w\\ = 4 and w is non-contractible. However, a 
direct computation shows that for a,b £ a+ and c = T{b),d = t(o), the 
characteristic sequence of w is the same as the one of the empty nanoword. 
Both consist of a single term 1 € 

9. Open questions and further directions 
Question 9.1. Classify nanowords of length < 10 up to homotopy. 

In |Tu2j we give a homotopy classification of nanowords up to length 6. 
The next step is to handle the nanowords of length 8. Does one need new 
homotopy invariants already for length 8 ? 

Question 9.2. Classify words of length < 7 up to homotopy. 

In |Tu2j we give a homotopy classification of words up to length 5. One 
may try to classify words by first classifying nanowords. However, short 
words may desingularize into quite long nanowords. For example, the word 
aababb, desingularizes into a nanoword of length 12. Still, a classification 
of words of length < 7 does not look unrealistic because they desingularize 
into a quite particular set of nanowords. 

Question 9.3. What (primitive) a-pairings can be realized as a-pairings 
of nanowords? 

Question 9.4. What polynomials A G A arise from nanowords? 

There are some simple known conditions, see |Tu2j . All new conditions 
are welcome. 

Question 9.5. Is it true that all nanowords over the alphabet consisting of 
a single element are contractible ? 
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At the moment, nothing contradicts the conjecture that the answer is yes. 

Question 9.6. Give a normal form for elements of a free a-kei on one 
generator. 

Such a normal form (or at least an algorithm to distinguish elements of 
this a-kei) would help to distinguish words up to homotopy. 

One further direction is the study of cobordisms of words. Cobordism 
is an equivalence relation generated by surgery on words which consists in 
deleting or inserting symmetric subwords or subphrases. There are difficult 
problems concerning the classification of words up to cobordism. This is 
studied in |Tu4j . 

Another interesting direction is a study of higher dimensional words over 
an alphabet a. Knot theory and other topological ideas used above gener- 
alize to higher dimensions. What can be said about similar generalizations 
of words? From the topological perspective, an n-dimensional nanoword 
is an immersion of a connected n-dimensional manifold into an (n -|- 1)- 
dimensional manifold. The double points of the immersion split as a union 
of connected (n — 2)-dimensional manifolds labeled with letters of a. The 
case n = 1 is treated in the present paper. The next case n = 2 is quite 
mysterious. What are the appropriate analogues of the homotopy moves for 
n = 2 ? Although a study of high-dimensional words is tempting, it is hard 
to imagine intelligent beings communicating with such words. 
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